Statistical estimation of unreliable features for robust speech recognition

نویسندگان

Philippe Renevey

Andrzej Drygajlo

چکیده

This paper addresses the problem of robust speech recognition in noisy conditions in the framework of hidden Markov models (HMMs) and missing feature techniques. It presents a new statistical approach to detection and estimation of unreliable features based on a probabilistic measure and Gaussian mixture model (GMM). In the estimation process, the GMM is compensated using parameters of the statistical model of additive background noise. The GMM means are used to replace the unreliable features. The GMM based technique is less complex than the corresponding HMM based estimation and gives similar improvement in the recognition performance. Once unreliable features are replaced by the estimated clean speech features, the entire set of spectral features can be transformed to the other feature domain characterized by higher baseline recognition rate (e.g MFCCs) for final recognition using continuous density hidden Markov models (CDHMMs) with diagonal covariance matrices.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

HMM-based estimation of unreliable spectral components for noise robust speech recognition

This paper presents a novel approach for reconstructing unreliable spectral components, which utilizes HMM-based missing feature algorithms, and applies them to noise robust speech recognition. The proposed technique uses the forwardbackward algorithm to estimate corrupt spectrographic data based on nearby reliable features, noisy observations, and on an underlying statistical model. The estima...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

Detection of Reliable Features for Speech Recognition in Noisy Conditions Using a Statistical Criterion

This paper addresses the problem of integration of missing data theory in the context of robust speech recognition in additive noise. It shows that techniques based on statistical estimation and thresholding of a posteriori signal-to-noise ratio (SNR) can be used for the detection of reliable (not much affected by noise) features as opposed to unreliable or missing (masked by noise) features. I...

متن کامل

Robust speech recognition using missing feature theory in the cepstral or LDA domain

When applying Missing Feature Theory to noise robust speech recognition, spectral features are labeled as either reliable or unreliable in the time-frequency plane. The acoustic model evaluation of the unreliable features is modified to express that their clean values are unknown or confined within bounds. Classically, MFT requires an assumption of statistical independence in the spectral domai...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Statistical estimation of unreliable features for robust speech recognition

نویسندگان

چکیده

منابع مشابه

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

HMM-based estimation of unreliable spectral components for noise robust speech recognition

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Detection of Reliable Features for Speech Recognition in Noisy Conditions Using a Statistical Criterion

Robust speech recognition using missing feature theory in the cepstral or LDA domain

عنوان ژورنال:

اشتراک گذاری